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Abstract. The main result of this paper is the existence of an optimal trans- 
port map T between two given measures fi and u, for a cost which considers the 
maximal oscillation of T at scale S, given by uig (T) := sup| ^ \T(x)—T(y)\. 

The minimization of this criterion finds applications in the field of privacy- 
respectful data transmission. The existence proof unfortunately only works in 
dimension one and is based on some monotonicity considerations. 
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1. Introduction and motivations 

Optimal transport problems represent the mathematization of a very natural 
applied question, which is the following: given the initial density of a certain amount 
of mass, and the target density that we want to realize, which is the best possible 
way to displace the mass so as to guarantee a minimal cost? 

Based on an idea dating back to G. Monge (see [9 ), this is formalized through 
a map T : X ^ Y with the property r#/i — v, where ^ and probabilities 
on X and F, respectively, are the two given distribution of mass, represented by 
two measures (the case of densities is retrieved when the measures are absolutely 
continuous). The constraint = v, expressed in terms of the image measure (we 
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recall that T^jj. is a measure defined through T^ijl{A) — /i(r^^(A))), stands for the 
fact that T must "rearrange" the distribution into the new fixed one, v. 

The typical criterion is based on the minimization of the average displacement 

min j \T{x) — x\P dfi{x) 

(we stick here to the euclidean case, where X and Y are subsets of M'^ and T{x) — x 
makes perfect sense, even if much has been said about other cases, in metric spaces, 
for instance). More generally, the criteria that have been studied are of the form 
J c{x,T{x)) dfi{x) for various cost-functions c, adapted to the different applica- 
tions. For this wide class of problems, which is now known to be linked to many 
other branches of mathematics, from fluid mechanics, to mathematical economy, 
differential geometry, functional inequalities and probability, an alternative, con- 
vex, formulation is available thanks to the ideas of L. Kantorovich (see jl]). The 
existence of an optimal T also passes first through this extended formulation. 

These classical problems in optimal transport theory are now a very lively do- 
main in pure and applied mathematics and most of them have already been solved 
or understood. Yet, it appears from some branches of applications, that a new 
generation of transport problems should be investigated, namely the minimization, 
over the same class of transport maps T, of more general costs, also depending 
on the differentiability or continuity properties of T. Let us think to optimization 
problems of the form 

min J \T{x) - x\P dn{x) + J \\/T{x)\^dx, 

where the goal is to find a good transport map, where its regularity also comes into 
play. Many variants of the form / L{x,T{x),'VT{x))dx could be considered, and 
they are already used in some applications, for instance in image processing (see 
|10j ) or shape analysis. Numerical studies on these issues have been performed, and 
not only in the last few years (see which anticipates a lot the current interest 
for this kind of problems in applied mathematics) . 

The difficulties in studying this higher-order problem are somehow different than 
in the usual Kantorovich theory: here the existence of an optimal T is typically 
easier to establish than in the usual theory, but much less is known about the 
characterization of the optimal maps. For instance, once we suppose that at least 
one map T providing a finite value to the energy exists, it is not difficult to prove by 
standard compactness arguments in Sobolev spaces that a minimizer exists. Yet, 
finding it or studying its characterization appears to be more difficult, and [5] gives 
for instance a very partial answer, in the one dimensional and uniform case where 
X C K and /z is the Lebesgue measure over X. 

Other criteria to minimize can easily appear to be meaningful, based for instance 
on the continuity of T instead of its derivatives. If minimizing the Lipschitz constant 
Lip(r) is easy to understand (and a straightforward application of Ascoli-Arzela 
Theorem provides existence of an optimal T), we want here to present a more tricky 
functional. For a fixed positive value S one can consider the following quantity 

ujsiT)= sup \Tix)-Tix')\; 

\x—x' I <(5 

which is merely the modulus of continuity of T evaluated at S. Minimizing lus{T) 
means finding a transport map T which is as continuous as possible, at scale 6. 
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It is interesting to see that this very minimization problem 

min {uJs{T) \ T^fi = ly} 

comes from a precise applied question which has been raised by the computer science 
community, in the framework of privacy-preserving protocols in telecommunications 
(seel). 

Let us try to explain why this precise minimization should play a role in this 
setting, even if the reader could easily imagine other possible applications of this 
optimization problem. Suppose that a high number of users are connected through 
their mobile phone to a service whose goal is to tell them whether their friends (from 
a social network list, for instance) are located or not within a certain fixed distance 
from them. To do so, all mobile phones periodically communicate their position to 
a common server which computes their distances and provides the users with the 
desired information. Yet, for privacy reasons, we do not want the server to know 
the position of each user, even if we want it to be able to compute their distances, 
which could seem difficult to realize. What is currently done to overcome this 
difficulty is that a third object, an external server, randomly chooses an isometry 
of the space (i.e. , a rotation R of the earth surface) and communicates it to the 
mobile phones of the users, but not to the server. The users then communicate their 
rotated position R{x) instead of their position x to the server, which is therefore 
able to compute their mutual distances, without exactly knowing their positions. 

However, this is not satisfactory yet, since if the number of users is large enough, 
the distance-calculator server could see which are the densest regional where most 
of the users are concentrated. It is not difficult to guess that these regions are more 
likely to correspond to Manhattan, Paris, Tokyo, and other strongly urbanized 
points of the Earth, and once one can reconstruct the position of these poles he can 
also reconstruct all the positions. This is a matter of non-uniform density, and this 
problem would not exist if the population on Earth was uniformly spread. Hence, 
an alternative idea could be the following: find a map T (instead of R) transforming 
the given population density /i into a uniform density v; it is clear that T cannot 
be an isometry, but one can look for the map T which gives a minimal distance 
distortion. We cite for instance the work [3] where the distortion is minimized in 
terms - roughly speaking - of the local bi-Lipschitz constant. 

But the true model which is of interest for the privacy community is really the 
minimization of the ujs modulus of continuity. Actually, if S stands for the threshold 
distance the users are interested in, and we set L — lus{T), we get \T{x) — T{y)\ > 
L ^ \x — y\ > 5. This means that all distances that are computed by the server 
to be larger than L correspond for sure to original distances larger than 6. The 
ambiguity stays true only in case the measured distance is smaller than L, but in 
such a case it is still possible to compute in a privacy-respectful way the distances, 
but with a more costly procedure (see [7], where cryptographic techniques are used 
to this aim). This finally means that it would be suitable to minimize L. 



Notice that it is not always necessary that an external server chooses a same random isometry 
for all the users, since an alternative way of proceeding is that each pair of users (actually their 
mobile devices) secretly agree on a randomly chosen isometry of the space that is not known to 
the server. In this case the server only knows mutual distances between users instead of their 
"rotated" positions, but this will be enough for him to make some clusters and compare to the 
distribution of the users in the world, which is well-known, see [2], and the final result would be 
the same. 
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It is also clear that, from the application point of view, it is not really necessary 
to find the optimal map T, but any map with a small value for ujs{T) would be fine. 
But exactly for this purposes it is worthwhile to study the minimization problem 
from a theoretical point of view, so as to find out possible general features of optimal 
maps (provided they exist) which could suggest how to produce "good" maps T. 

This is why the present paper investigates, as a very first step in this research 
direction, the existence of an optimal map. This problem is quite hard since impos- 
ing continuity only at scale S is not enough to give compactness for the minimizing 
sequences. 

On the positive side, as in the most classical traditions in transport problems, 
there exists a Kantorovich version of this problem, which is presented in this paper 
and existence of a minimizer for this extended problem is then proven. But the 
usual strategy consisting in proving that the Kantorovich minimizer actually derives 
from a map T is not easy to implement. 

A solution to this problem is proposed in dimension one, where it is possible 
to add an extra property: it is proven that an optimal map exists, and that it is 
piecewise monotone, where the number of monotonicity changes is at most of the 
order of 1/(5. 

In order to arrive to this result, the paper is organised as follows: Section 2 
presents the key features of the problem we want to solve (the minimization among 
transport maps T) and of its Kantorovich relaxation, including the existence of 
a minimizer for this relaxed problem; Section 3 slowly gets to the existence of 
an optimal transport map starting from the optimal transport plan and applying 
suitable constructions; Section 4 gives an easy but interesting example where the 
optimal map is not monotone, but only piecewise monotone; finally, we describe 
in Section 5 how to handle some mathematical extensions of the problem that we 
preferred not to introduce from the beginning so as to make the paper more readable 
and to concentrate on the main ideas without too many technicalities. 

2. The original problem and its Kantorovich formulation 

Let S he a. positive real number and /i, v two probability meastires on M''. Let 
us denote f2 and fl' the supports Supp(/i) and Supp(t^) of the two measures, and 
suppose for simplicity that they arc compact (see Section 5 for the adaptations to a 
non-compact setting). All the functions that we consider will be defined on f2 (this 
will also be discussed in Section 5). We are interested in minimizing the functional 

{Ms) uJs-.T^ sup \T{x)-T{x')\ 

\x -x'\ <5 
X, x' G Supp(/i) 

where T lies in T(/i, v) :— {T e B{^; VL') : T^ii = p}, the set of transport maps 
from to v. The quantity uis is merely the modulus of continuity of T evaluated 
at i5. 

2.1. Kantorovich formulation. Inspired from the Kantorovich reformulation of 

the initial Monge problem, we can consider the minimization on a wider class of 
objects : probability measures on the product space, or transport plans, instead 
of transport maps. This should guarantee easier existence results and leads us to 

study the relaxed functional 

{Ks) : 7' — > (7<8)7) -esssup{|2/-i/'| : {x,y),{x' ,y') e f2 x O', \x-x'\ < 5} 
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where 7 lies in n(/x, j^) := {7 G V{ft x fl') : (7ri)#7 = /n, (7r2)#7 = This map 
can be expressed as 

where 

/ : (a, b)e{nx n') x{nx n') ^ |7r2(6) - Tr2{a)\lBio,S){Mb) - Ma))- 

Since the above function / is lower semicontinuous, looking at its supremum on 
any set gives the same result as the supremum on the closure of the same set; in 
particular, its essential supremum coincides with the supremum on the support of 
the measure, hence we have 

"^f (7) = sup {\y -y'\: {x, y), {x',y') e Supp(7), |a; - a;'| < ^} . 

In the following, all the topological notions on ■P(n), where O is compact, will re- 
late to the weak-* topology with the identification to the dual of functions vanishing 
at infinity, i.e. Co{fiy which equals C(r2)' for fl is compact. 

The entire paper is devoted to the existence proof of an optimal transport map 
when /i has no atom and O C R. Further effort should still be made to handle the 
multi-dimensional case. 

2.2. Existence of an optimal transport plan. 

Theorem 1. Given two probabilities 12, v on fl and fl' respectively, and 6 > 0, 
there exists a transport plan 7 € n(/i, v) minimizing the cost . 

Proof. Let us take a minimizing sequence 7„ e n(/i, i/) and set r„ = Supp(7„). 
Up to subsequences, we can suppose weak-* convergence 7„ — ^ 7 and Hausdorff 
convergence r„ T. It is clear that we have Supp(7) C F and hence 

(7) < sup - y'\ : (x, y), {x' , y') e F, |x - x'\ < 6} . 

Now, take two arbitrary points {x,y), {x' ,y') G F with |x — x'| < 6. By Haus- 
dorff convergence, it is possible to build two sequences (a;„,7/„), (a;^,y^) G F„ with 
{Xn,yn) {x,y) and {x'^,y!^) — {x',y'). In particular, for n large enough, they 
satisfy |x„ — < 6 and hence |y„ — y'^] < a;|'^(7„), which yields \y — y'\ < 
lim inf„ w|^(7„). Passing to the supremum over the points of F we get u!^{'y) < 
liminf„ a;^(7„). This semicontinuity proves that 7 is optimal. □ 

3. Existence of an optimal transport map 

In the following, we denote by K the minimal value of ujf on n(yLt, f), by 7* an 
arbitrary minimizer for this relaxed problem and by F its support : 

K := min 7* G argminwl*^, F:=Supp7*. 

Notice that the projection 7ri(F) of the support on the first factor oi Q x equals 
Supp(/ti), i.e. Q. 

We shall now state the main theorem of this paper: 

Theorem 2. Let be compact set iuR, n G 'P{fl) and v G 'P(il') two given 
probabilites on them. If /x has no atom, there exists an optimal transport map 
T G T{ij., v) for the cost ojs, where 

uJs:T\ — > sup \T{x) -T{x')\. 

\x - x'\ < 5 
x,x' 
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3.1. Preliminary remarks and definitions. We denote by / and g the two 

functions defined on 7ri(r) = O delimiting the convex hull along y oiT i.e. 

f{x) := inf {y : {x,y) G T} , 
g{x) := sup{j/ : G T) . 

Remark. These functions are Borel measurable and in particular / is lower semi- 
continuous and g is upper semicontinuous. Actually, since T is compact, for every 
X there is y such that {x, y) G F and f{x) = y. If one takes a:„ — > x and y„ — f{xn) 
then there is a subsequence y„j. y ~ liminf„y„. Since T is closed, we get 
(a;,y) 6 T. Then one has f{x) < y and semicontinuity is proven. Upper semiconti- 
nuity for g is completely analogous. 

The strip [/, g] :— {(x, y) G x M : f{x) < y < g{x)} satisfies the following prop- 
erties : 

(Inc) rc[/,g] 

(Opt) yix,y),{x',y')e[f,g], {\x - x'\ < S =^ \y ~ y'\ < K) 

Definition 1. A strip "0] where -0 : £7 — R is said optimal if it satisfies the 
optimality property ( |Opt[ ) , and admissible if it satisfies both the inclusion (jincp 
and optimality ( |Opt[ ) properties. 

The functions / and g have not a priori extra regularity properties than simply 
being lower and upper semicontinuous. The aim is to replace them with more reg- 
ular ones, defining another admissible strip. Once such a nicer strip is constructed, 
we will prove that it contains the graph of a transport map, thus completing the 
proof, because the optimality property guarantees that any transport plan living 
in the strip has minimal cost. 

Since [/,<?] is optimal, we have \f{x) — g{x')\ < K whenever \x — x'\ < 6, yield- 
ing 

(3.1a) /(x)> inf gix')-K, 

\x — x' I <(5 

(3.1b) gix)< sup f{x') + K, 

motivating the next definition. 

Definition 2. For every function : — > R, we define its t and ]. transform^ as 
follows : 

(3.2a) cl)Hx)= sup 0(y), 

y^i1.\y — x\<6 

(3.2b) (jj^ix) = inf (l){y). 

By definition of the transforms, the strips [/, -|- K] and [g^ — K, g] are optimal. 
Moreover, rewriting (|3.ip . one gets g^ — K<f<g<f'^ + K, meaning that and 
g^ are respectively the largest and smallest functions such that the strips [/, /"I" + K] 
and [g-^ — K, g] are optimal. In particular, they also keep satisfying the inclusion 
property. This is stated in the next proposition. 



These transforms also depend on the domain Q which is used to define the sup and the inf, 
but we will omit this dependence, which will be implicit throughout the paper, thus avoiding 
writing as (f)^'^ and similar heavy notations. 
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Proposition 3. For all optimal strip [0,')/'] where 0, ?/) : i7 — > M, 0^ and cj)^ are 
respectively the largest and smallest functions such that [(j>, (f)^ + K] and [tp^ — K, ip] 
are optimal. Therefore, if [0, ip] is admissible, then these strips are also admissible. 

So far, we have shown that the admissible strip [/, g] can be replaced by [/, + 
K], which is still admissible. Then it can be replaced by [/^~'',/^ + K], since 
(/^ + K)^ — K = /''"■'-, thus enlarging the strip twice. One could wonder if this 
construction should go on, and the answer is negative, as a consequence of the 
following proposition. 

Proposition 4. For all cj) : fl ^ R, 

Proof. Since 0^-'- is the lowest possible, (f>^^ < cj), hence (j)^^'^ < 0^ by monotonicity 
of the transforms. As for the converse inequality, \/y £ B{x,6), 

0ti(y) ^ sup (p'^iz) > (t>^{x) 
zeBiv.S) 

so that by taking the infimum over B(x, 6), 

(p'^-^Hx) > (P^x). a 

This shows that the strip [f^^^, f"^ + K], which is admissible, cannot be enlarged 
without losing the optimality condition. We will see that the t and J, transforms 
have regularizing properties which justify the replacement of [/, g] by [/^~'', + K]. 

3.2. The proof. We shall now study further properties of the transform operations 
defined previously and prove a few lemmas which will be useful in the final proof 
at the end of this section. 

Definition 3. Let (t),ij; : il ^ W. We say that (cj), ip) is a conjugate pair \i (f)^ = -tj) 
et ip^ = (f). 

Remark. By Proposition |4l [(f)^^,(f)^) is a conjugate pair. 

Proposition 5. Let (/> : — > K a Borel function. Then, (j)^ and (p^ are regulated 
function^, hence continuous outside a countable set. Moreover they are respectively 
lower and upper semicontinuous. 

Proof. Since </> is measurable and bounded, it can be expressed as the uniform limit 
of simple functions 0„ . Now let us see how the t and J, transforms act on simple 
functions. Let 4> a simple function 

N 

(j) — UilAi where ai < ■ • ■ < a^. 

i=l 

We shall calculate its t transform (f>^. Since its value at some point x is defined by 
the infimum of cp on B{x, S), (p^ is equal to ai at any point which is distant from 



■^A linear combination of characteristic functions of measurable sets will be called a simple 
function, and if these sets are intervals we call it a step function. A regulated function is by 
definition a uniform limit of step functions. This notion coincides, by the way, with that of 
functions having left and right-sided limits at every point. Note that we work here with functions 
defined on fi, where intervals are traces on Q of intervals of R. Therefore, regulated functions on 
O are exactly restrictions on Q of regulated functions on R. 
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Ai by less than S, i.e. on the set n A| where Al := Ai + -6(0, 6). It is a disjoint 
reunion of intervals of length at least 26, hence the reunion is finite. A similar 
reasoning tells us that {(f)^)^^{a2) is equal to the trace of A| := {A2 + -8(0, 6)) \ A[ 
on ft, and more generally that (j)^ equals on n Aj, where 

4 := {Ak + B{0,5))\(^ljA{^ 

which is a finite and disjoint reunion of intervals. Furthermore, since (j) is supposed 
to be the limit of (^n)n and by monotonicity of the t transform, if \\^n — (t>\\^ < e 
then 

^-£<(t)n<^ + s hence {(j) - s)^ = cj)^ - s < (1)1 < {(f) + = (j)^ + s. 

It proves that (f)^ is the uniform limit of (f>1^, which are step functions : it is a 
regulated function. It is a classical issue to see that the set of discontinuity points 
of any regulated function is at most countable. The same result naturally holds for 
the 4, transform of 0. 

Now we shall prove that c/)^ is upper semicontinuous. Consider a sequence {xn)n 
converging to x. Since (f)^ = inf 5( . 5) </>, if a G B{x, S) then for n large enough 
\a — Xn\ < S, i.e. a € B{xn,S), and (f)^{xn) < <P{a)- This implies that lim (a;„) < 
(/)(a) for all a S B{x,S), yielding 

\im(j)^{xn) < inf ^(a) = 0^(x) 

aeB{x,S) 

which proves the upper semicontinuity of (j)^. Since (f>^ = —{—(j))^, we get lower 
semicontinuity for (f)^. □ 

In the following, all intervals, possibly given in the form \a,b\ where | is either [ 
or ], will be intervals of O. 

Lemma 6. Let (p,ip : —^R conjugate step functions, ip — cp'^' being expressed as 

j=l,...,N 

where the Ij 's are intervals partitioning and sorted in increasing order, and the 
Uj are consecutively distinct. If (ak-i,ak,OLk+i) is a triple such that ak-i > otk, 
ak+i > ctk (we say that Ik is a floor), then the distance between Ik+i and Ik-i, 
defined as milk+i — sup/jj^i (which could he larger than the measure of Ik, or 
even of its diameter, if f2 is disconnected) is at least 25. The symmetric result on 
ceilings of (p holds. 

Proof. By contradiction, assume that the distance between Ik+i and Ik-i is smaller 
than 26. Pick a point x in Ik and a G f2 such that d{a,x) < 6. The ball B{a,6) 
contains the point x € Ik but is wider than the distance between the two intervals 
Ik+i and Ik-i- hence it intersects either Ik-i or Ik+i at a certain point b. Knowing 
that {^jip) is conjugate, it follows that (f){a), which equals 'ip^{a) = supg(^^ g-^'ip, is 
greater or equal than ip{b), hence 0(a) > mm{ak-i, ak+i). This is true for all a in 
B{x, 6), so that (p > min(afc_i, ak+i) on finB{x, 6), and ip{x) > min(afc_i, ak+i) > 
ak for ip{x) = ^'^{x) = mfB(a:,5) 4'- This cannot be true. □ 

This result will allow us to control the amount of monotonicity changes of / and 
g uniformly, i.e. in terms of f2 and 6 and independently of the functions / and g. 
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Corollary 7. The number of floors Fi < . . . < Fn of is bounded by a constant 
M(fl,S), independent from 4>,ijj. Moreover, ip is nondecreasing then decreasing on 
each interval between floors, namely it increases on Gi, . . . , Gm+i md decreases on 
Hi, . . . , i/AT+i where 

Gi < Hi < Fi < G2 < H2 < • • ■ < F]\j < Gn+1 < ^7V+i 
is a subdivision of in 3N + 2 intervals. A symmetric statement holds for <j). 

Proof. We use the same notations as in Lemma [S] : 

j = l....,N 

where the /j's are sorted in increasing order and the a^'s are consecutively distinct. 
We have shown that floors separate their adjacent steps by a distance of at least 
25. Since 57 is bounded, say it has length L, they cannot be more than 

Now, the union of the intervals between floors (or reaching an endpoint of 57) can 
be cut into two parts, -0 being nondecreasing on the left one, then decreasing on 
the right one, because floors separate higher neighbouring steps by definition. The 
number of intervals in this subdivision is bounded from above by 

^ +2=:M'{^,5). □ 

Now, we shall extend this result from conjugate step functions to conjugate Borel 
functions. 

Lemma 8. Let (j),^ : ^M. conjugate Borel functions. Then there exists a cover 
of $7 by intervals 

Gi < Hi < Fi < G2 < H2 < • • ■ < F]\j < Gn+1 < ^w+i 

sorted in increasing order, such that N < M{Q,d), ip is nondecreasing on each Gi, 
nonincreasing on each Hi, and constant on each Fi. Moreover, d{Hj,G.j+i) > 26 
for all j . 

Proof. Let {(j)n)n a sequence of simple functions converging uniformly to (j). We have 
already shown that (pl^ converges uniformly to (p'^ = ip and (p}^^ converges uniformly 
to (p^^. Therefore, up to some renaming, we may assume that (0„, ipn) is a conjugate 
pair of step functions such that {(pn,ipn) — > V') uniformly. Then for all n G N, 
let us take (i^fe)fe=i,...,Ar„, (G^)fc=i,...,jv„+i, (-fffc )fc=i,...,w„+i as in Corollary [71 Up 
to extraction, Nn being a sequence of integers bounded by M = M{rt,6), we may 
assume that it is constant and equal to some N. So far we have 

< i?r < F^ <G'i<...<F^< G-%+1 < H^+i 

such that ipn is nondecreasing on the G^'s, decreasing on the HJ^^s and constant on 
the f^'s. For each fc = 1, . . . , A^, by further extraction, we ensure that the endpoints 
of all these intervals converge monotonically when n — 00, which is possible since 
57 is compact. Let us denote by {Gi)i, {Hi)i and {Fi)i the limit intervals 

= lim G", H,= lim fff , F, = lim F^, 

in the sense of point set limits (these are well defined by the monotone convergence 
of their endpoints). If we ignore the endpoints of each limit interval Fi, Gi and Hi, 
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it is easy to check that these sets give a partition of fl and that cj) and ip wiU keep 
the same monotonicity behavior of and tpn on the interior of each intervaL □ 

Lemma 9. If (j)^^ are conjugate Borel functions, there exists a finite subdivision 
of r2 into intervals such that they are of same monotonicity on each interval of the 
subdivision. 

Proof. Take the subdivision 

Gi<Hi<Fi<G2<...<Fn< Gn+1 < Hn+1 

given by Lemma |S1 where each interval is a monotonicity or constancy interval for 
-0. Each floor interval Fi may be divided into two parts F^ (the first half of the 
interval) and F^ (the second). More precisely, we define F^ — i^ifl] — oo, mi] and 
F^ = [?7ii,+oo[, where :— (inf G^+i +supiJi)/2 is the middle point between 
the adjacent endpoints of the two intervals next to Fi. In this way both F~ and 
F^ are at least 5 long (in the sense that — swpHi and inf G^+i — rrii are at least 
5) and moreover is nondecreasing on each interval of the form F^ U G^+i (and 
on the first interval Gi) and nondecreasing on each Hi U F~ (and on Hn+i)- 

It is not difficult to check that (p has the same monotonicity of ip on these 
intervals. Let us consider for instance the case of F^ U G^+i, where ■0 is nonde- 
creasing. Let us denote by a and h its endpoints, i.e. a — inf .F^'^, b — supGi+i and 
F^ U G,:+i = [a, b] n Vl. Consider that -0 is also nondecreasing (actually, constant) 
on [a — (5, a] , since this segment is included in F~ : this implies that (j) = ip^ is, 
nondecreasing as well on the interval [a, b — S\ (as a consequence of the fact that the 
behavior of (f> on an interval only depends on the behavior of on the same interval 
enlarged by 6). We are only left to prove that (p is also nondecreasing on ]6 — 5, 6] 
but this is easy to check since (p is actually constant on this segment. Indeed, the 
value Tpib) is the maximum of ip on [6 — (5, 6 + (5], which implies that cp is constant 
on]&-(5, 6 + (5[. 

An analogous proof works for the intervals Hi U Fp^ and for Gi and Hj^^i. □ 

This last result will be the key point in the proof of the existence of an optimal 
map, since the following lemma allows for building transport maps which are in- 
cluded in a given strip, provided the boundaries of the strip are given by functions 
with the same monotonicity. 

Lemma 10. Consider two probabilities /i G 'P(f^) and v G 'P(ri') and two Borel 
functions (p,ip : fl Q with the same monotonicity and such that <p < ip. If there 
exist 7 G n(/i, z^),r e T(/i, i/) .such that Supp7 C [0, V'] ^'^'^ .such that T has the 
same monotonicity as (p and ip, then the graph ofT is ^-almost everywhere included 
in [<p, Ip] . 

Proof. Let a be such that for some xq we have T{xq) < a < 4'{xq). One has 
li{x : T{x) < a) = v{y : y < a) and hence j{{x,y) : T{x) < a) — ^{{x,y) : y < a). 
By subtracting the same quantity "f{{x, y) : T{x) < a, y < a) to these two measures 
we get 

7((a;, y) ■■ T{x) <a <y) = -f{{x, y) : y < a < T{x)). 
But for {x, y) to be in the right-hand side set, since T{xq) < a, T is nondecreasing 
and a < T(x), one must have x > Xg. Hence (p{x) > (pi^o) > o, since (p is also 
nondecreasing. Therefore 

7 {{x, y) -.y <a< T{x)) < 7 ((x, y) : y < a < (p{x)) 
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which is null for 7 is concentrated on F C [(f>, tjj]. As a result 7((x, y) : T{x) < a < 
y) = and 'j{{x,y) : T{x) < a < (p{x) < y) = as well by inclusion. Notice that 
the condition < y is useless here, since 7 is concentrated on V"]; ^^nd this 
also gives 

^{x : T{x) < a < 0(x)) = ^{{x, y) : T{x) < a < <j){x)) 

^-f{{x,y):T{x)<a<(t){x)<y)^Q. 

This means that for all a such that T{xq) < a < (t>{xo) for some xo, 

fi{x : T{x) <a< (f>{x)) = 0. 

Taking a countable dense set of such a's, this yields 

fi{x : T{x) < (f>{x)) = 0. 

An analogous proof provides T < ip /i— a.e. □ 

We are now ready to prove the main theorem of this paper. 

Proof of Theorem\^ We replace / and g by /^~'- and + K . Lemma [5] shows that 
one can find a finite subdivision of il into intervals f^i, . . . , JIat on which / and 
g are of same monotonicity. Let us set 7* = 7|f2.xn' ~ ""1(7^)1 ^3 — ''^^ilj) 
and fj = f\n., g.j 51^^ so that 7* e Ii{iij,Vj) is supported in [fj,gj\. Since the 
/ij's have no atom, it is a classical result that there exists a unique transport map 
Tj £ T{fij,Vj) with the same monotonicity of / and g. Since Supp7j C [/j,<7j], 
Lemma[TU] guarantees that Tj has its graph /i-almost everywhere included in [fj,gj]- 
We shall naturally glue these Tj's together, posing T{x) = Tj{x) on Vlj. It is clear 
that T S T(/i, i^) and that Supp7T C [/, ff] which implies by optimality of [/, 5] 
that uJs{T) — K. It is an optimal transport map for the cost ojs. □ 



We finish this analysis with an easy counter-example, showing that the optimal 
transport map T is not always monotone (which would trivialize the interest of 
the previous existence results). This example is essentially due to J. Louet ([5J), 
who found it for another variational problem. Yet, it can be easily adapted to our 
scopes. 

Consider the map U : [0, 1] — > [0, 1] given by 



Consider /i = f{x)dx a probability measure on [0, 1] given by the density /: 



4. A COUNTER-EXAMPLE 





Take v = U^fi. It is not difficult to check that 1/ is supported on [0,1], and is 
absolutely continuous with density g 
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Also, one can compute the unique monotone increasing map T such that = u. 
Its expression is 




if a; e [0, i] , 
if a; e [i,f] , 

if a; e [f ^1 



,i + 4(x-|) ifxe[|,i]. 

The unique monotone decreasing map is simply symmetric to T, due to the sym- 
metry of the starting measure /x. 

Consider now (5 < |. It is easy to check that we have ujs{T) — 46 (a slope of 4 
is realized in the last interval [|, l] , whose length is |), while ws{U) ~ 2S (since U 
has always slope 2) . 

This proves that, for these given choices of /i, v and 6, the optimal map cannot 
be T (it does not prove on the contrary that the optimal map is U). 





5. Technical extensions 

For the sake of simplicity, we tried to describe our problem sticking to the easiest 
case. For instance, we assumed both Supp(/i) and Supp(i') to be compact, which 
simplified some proofs. 

Also, we used the same set fl both as the support of fi and as the domain where 
the functions are defined and the oscillation lus is computed. Indeed, if we take 
a measure which is not fully supported on a set fi, we could face two reasonable 
choices for the functional a;^ , since one could take the supremum over pairs of point 
x,x' G Supp(/x) with \x — x'\ < (5, or more generally over x,x' G fl with \x — x'\ < S. 
We chose the first definition, which is easier to handle and corresponds more to the 
application we had in mind. 

Yet, it is true that the behavior of T on $7 \ Supp(//) does not affect the image 
measure constraint, but it could affect the value of ujs if the second definition is 
chosen, thus penalizing big jumps of T between different connected components 
of Supp(/i). The problem is that this second definition makes it more difficult to 
define a Kantorovich approach and we need to slightly change our functional Wg^ . 

5.1. Optimal plans and maps in the non-compact case. Few adaptations 
have to be performed for the existence of an optimal plan if /i, v are not compactly 
supported. 
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First, let us notice that in this case the existence of a plan 7 £ n(/i, i') such that 
utg^ {■~f) < +00 is not straightforward (unless Supp(i^) is compact) and has to be 
supposed. This fact more or less corresponds to the fact that j/ has a queue which 
is comparable to (or smaller) than that of fi, i.e. that there exists a constant k such 
that v{B'f,^) < fJ.{B'f) for large r. Anyway, let us assume that mi{Ks) < +00. 

It is standard and well-known in optimal transport (see for instance [11]) that 
for given fi, v the set n(/x, v) is a tight subset of Vi^l x SX). This still allows for the 
extraction of a weakly converging subsequence 7„ ^ 7. 

Obviously we cannot take a subsequence such that r„ Hausdorff converges to 
r since this would require the domain to be compact. Yet, we can easily, by a 
diagonal argument, extract a subsequence such that for every natural integer i? we 
have r„ n (S^ X ;b^) Tij C X B^. 

Since we can write wj^ = sup^ '^^ri where 

wffl,(7) := sup{|y-?/'| : (x, y), (a;', y') £ Supp(7) n (S^ x S^) , \x-x'\ < S} , 

we can infer the semicontinuity of ou^^ from that of each ujfj^, which can be proven 
using Supp(7) n {Br x Br) C and applying the same arguments as above. 

Once the existence of an optimal plan is established, one needs to adapt the 
content of Section 3 to the case where fl is non-compact. This is not difficult once 
we notice that the support F of an optimal plan 7 must be "locally bounded" in the 
following sense. 

If we suppose that the minimum of oj^ is finite, i.e. K < +00, then every vertical 
fiber {y : (x, y) G F} has diameter bounded above by K (since if (x, y) and (x, y') 
belong to F, then we should have \y — y'\ < K due to |a; — a;| = < 5). This proves 
that / and g are well defined. Not only, for every x f is bounded above by f{x)+K 
on the whole ball B{x, S), which allows, by recursively applying this bound, to say 
that / is locally bounded. Analogous considerations hold for g. In particular, the 
intersection of the support of F with vertical strips of the form Br x R are bounded 
and hence compact. 

The only extra point to remark in order to perform the same analysis on M 
is that we will have no more a finite number of intervals: when we approximate 
(Proposition [5]) (j) and ip with step functions this will be done with functions which 
are constant on a countable (but locally finite) number of intervals; the sum in 
Lemma |6] will be no longer finite but locally finite, and the bounds on the number 
of intervals appearing in Corollary [7] and on will only be local. Yet, the main points 
of the proof will stay the same, since they are essentially local. 

5.2. Wider definition of uig- If one considers a measure fi which is not fully 
supported on Q but wants to define a;^ in the following way 

MT):^ sup \Tix)-T{x% 

x.x'^^i. \x — x'\<S 

then the definition of the functional oj^ has to be changed, since that of Section 2.1 
only considers pairs (x, y) and {x',y') in the support of 7, so that x,x' G Supp(/z). 
A possible way to overcome the problem is the following: define 

c.f(7) :-inf{sup{|2/-y'| : (x, y), (x', y') e F, |x-x'|<<5} : Fg^(7)}, 

where 

Ai-f) :={TcnxR : F D Supp(7), 7ri(F) = ft}. 
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This means that, instead of computing a maximal oscihation on the support 
of 7, we compute it on sets which are extensions of this support, but have full 
projection onto fl, and we chose the best possible extension. Again, this functional 
only depends on Supp(7), as in Section 2.1. 

The existence of an optimal 7 both in the compact or non-compact case easily 
follows from the same considerations, simply replacing the support with this set F 
(in a minimizing sequence, take a sequence of sets r„, make it converge to a set F, 
which will contain the support of the limit measure. . . ) . 

Moreover, for any fixed measure 7 it is straightforward that an optimal set F 
does exist (same argument), and we will use this set to define the functions / and 
g of Section 3.1. The rest of the construction is exactly the same. 
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